Propose pattern for pipelines with multiple/optional correction steps #211

SimonHeybrock · 2025-05-01T05:41:09Z

This is a suggestion for a pipeline design pattern, addressing recurring needs for selecting one or more optional correction steps.

The notebook is a draft for gathering feedback on the idea. Will be cleaned up if positive.

See also #192, which I hope is not necessary if the suggested pattern works out.

jl-wynen · 2025-05-02T07:30:06Z

Nice idea. But this will lead to an exponential growth of functions in the number of corrections. This sounds like a maintenance problem.

We could mitigate this slightly by using something along the lines of

def apply_corrections(corrections: list) -> Callable[...]:
    # just a sketch, doing this requires unpacking the corrections list into separate arguments:
    def apply(raw_data: RawData, *corrections): return ....
    return apply

But this way, it would not be (easily) possible to find the actual function from the graph viz or JSON.

SimonHeybrock · 2025-05-02T07:36:32Z

Nice idea. But this will lead to an exponential growth of functions in the number of corrections. This sounds like a maintenance problem.

I doubt there can be too many that are all applied at the same stage of a pipeline? What is the largest number you have encountered?

We could mitigate this slightly by using something along the lines of
def apply_corrections(corrections: list) -> Callable[...]:
    # just a sketch, doing this requires unpacking the corrections list into separate arguments:
    def apply(raw_data: RawData, *corrections): return ....
    return apply
But this way, it would not be (easily) possible to find the actual function from the graph viz or JSON.

Not sure I understand how your list approach works.

jl-wynen · 2025-05-02T07:44:35Z

I doubt there can be too many that are all applied at the same stage of a pipeline? What is the largest number you have encountered?

Just 1 correction with 3 options in powder: the 'run normalisation'. In reflectometry, there are more corrections. @jokasimr How many are there?

We also have the pixel masks with an arbitrary number of masks. Can we also handle them with this approach instead of having a dedicated function that people have to apply to their pipeline?

Not sure I understand how your list approach works.

I was just brainstorming. But essentially:

wf.insert(apply_corrections([apply_a, apply_b]))

where apply_corrections([apply_a, apply_b]) creates and returns a function of the form

def apply(raw_data: RawData, correction_a: CorrectionA, correction_b: CorrectionB) -> CorrectedData:
    return apply_b(apply_a(raw_data, correction_a), correction_b)

SimonHeybrock · 2025-05-02T08:04:14Z

I doubt there can be too many that are all applied at the same stage of a pipeline? What is the largest number you have encountered?

We also have the pixel masks with an arbitrary number of masks. Can we also handle them with this approach instead of having a dedicated function that people have to apply to their pipeline?

We usually used map/reduce for them? Are there other cases?

Not sure I understand how your list approach works.

I was just brainstorming. But essentially:
wf.insert(apply_corrections([apply_a, apply_b]))
where apply_corrections([apply_a, apply_b]) creates and returns a function of the form
def apply(raw_data: RawData, correction_a: CorrectionA, correction_b: CorrectionB) -> CorrectedData:
    return apply_b(apply_a(raw_data, correction_a), correction_b)

I am not sure you can auto-generate such a function that would work with Sciline?

jokasimr · 2025-05-02T08:34:12Z

Just 1 correction with 3 options in powder: the 'run normalisation'. In reflectometry, there are more corrections. @jokasimr How many are there?

If we are talking about scalar multiplicative corrections (that is, not polarization correction or background subtraction)
I don't think there are more corrections in reflectometry than for other techniques. For ESTIA there are:

Correction by virtual slit opening (essentially "footprint correction")
Correction by monitor (or proton current, or measurement time)
Correction by supermirror reflectivity (only for reference measurement)

If we are talking about "corrections" to coordinates (not to event weights). There might be more of those.

We also have the pixel masks with an arbitrary number of masks.

There are a number of masks applied, more than the number of corrections.

SimonHeybrock added 2 commits May 1, 2025 07:43

Propose pattern for pipelines with multiple/optional correction steps

ac6fcf5

Clarify

124e910

SimonHeybrock force-pushed the optional-correction-chains branch from 1ac947e to 124e910 Compare May 1, 2025 05:43

SimonHeybrock marked this pull request as draft May 1, 2025 05:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Propose pattern for pipelines with multiple/optional correction steps #211

Propose pattern for pipelines with multiple/optional correction steps #211

Uh oh!

SimonHeybrock commented May 1, 2025 •

edited

Loading

Uh oh!

jl-wynen commented May 2, 2025

Uh oh!

SimonHeybrock commented May 2, 2025

Uh oh!

jl-wynen commented May 2, 2025

Uh oh!

SimonHeybrock commented May 2, 2025

Uh oh!

jokasimr commented May 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Propose pattern for pipelines with multiple/optional correction steps #211

Are you sure you want to change the base?

Propose pattern for pipelines with multiple/optional correction steps #211

Uh oh!

Conversation

SimonHeybrock commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jl-wynen commented May 2, 2025

Uh oh!

SimonHeybrock commented May 2, 2025

Uh oh!

jl-wynen commented May 2, 2025

Uh oh!

SimonHeybrock commented May 2, 2025

Uh oh!

jokasimr commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

SimonHeybrock commented May 1, 2025 •

edited

Loading

jokasimr commented May 2, 2025 •

edited

Loading